The management and mining of multiple predictive models using the predictive modeling markup language

نویسندگان

  • Robert L. Grossman
  • Stuart Bailey
  • Ashok Ramu
  • Balinder Malhi
  • Philip Hallstrom
  • Ivan Pulleyn
  • Xiao Qin
چکیده

We introduce a markup language based upon XML for working with the predictive models produced by data mining systems. The language is called the Predictive Model Markup Language (PMML) and can be used to define predictive models and ensembles of predictive models. It provides a flexible mechanism for defining schema for predictive models and supports model selection and model averaging involving multiple predictive models. It has proved useful for applications requiring ensemble learning, partitioned learning, and distributed learning. In addition, it facilitates moving predictive models across applications and systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A KDDSE-independent PMML Visualizer

Several knowledge discovery support engines (KDDSE) feature the export and in a few cases even the import of data mining models in the Predictive Modeling Markup Language (PMML) standard. A visualization tool for PMML models that is independent of a specific KDDSE is presented in this paper. An extension of the PMML model for association rules that allows the definition of propositional and fir...

متن کامل

Exchanging Data Mining Models with the Predictive Modelling Markup Language

The aim of the Predictive Model Markup Language (PMML) is to support the exchange of data mining models between different applications and visualization tools. It is the result of a standardization effort by a group of vendors. PMML is an XML-based language (grammar) for describing data mining models. Despite its name, it is not limited to predictive models. The contribution of this paper is tw...

متن کامل

A Proposed Model to Identify Factors Affecting Asthma using Data Mining

Introduction: The identification of asthma risk factors plays an important role in the prevention of the asthma as well as reducing the severity of symptoms. Nowadays, the identification process can be performed using modern techniques. Data mining is one of the techniques which has many applications in the fields of diagnosis, prediction, and treatment. This study aimed to identify the effecti...

متن کامل

Predicting Bankruptcy of Companies using Data Mining Models and Comparing the Results with Z Altman Model

One of the issues helping make investment decisions is appropriate tools and models to evaluate financial situation 0f the organization.  By means of these tools, investors can analyze financial situation of the organization and identify financial distress or an ideal condition, they become aware of making decisions to invest in appropriate conditions.  The main objective of this study is to ev...

متن کامل

Trip pattern of low-density residential area in semi urban industrial cluster: predictive modeling

This research elucidates the trip pattern of the low-density residential zone in a semi-urban industrial cluster of southwestern Nigeria. These sets of dwellers are often times neglected in the transportation planning process with the view that it is not a residential zone. Domiciliary information gathering procedure was employed in the analysis with 0.82 return rates. It was backed up with the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Information & Software Technology

دوره 41  شماره 

صفحات  -

تاریخ انتشار 1999